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Abstract — Distributed configuration management is imperative 
for wireless infrastructureless networks where eacli node adjusts 
locally its physical and logical configuration through information 
exchange with neighbors. Many algorithms have been developed 
with promising results in this area. However, two issues remain 
open. The first is the optimality, i.e., whether a distributed algo- 
rithm results in a near-optimal network configuration. The second 
is the complexity, i.e., whether a distributed algorithm scales 
gracefully with network size (N). We study these issues through 
modeling, analysis, and randomized distributed algorithms. 

Modeling defines the optimality. We first derive a global 
probabilistic model for a network configuration which charac- 
terizes jointly the statistical spatial dependence of a physical- 
and a logical-configuration. The model is a Gibbs distribution 
that results from internal network properties on node positions, 
wireless channels and interference; and external management 
constraints on physical connectivity, signal quality and configu- 
ration costs. We then show that a local model which approximates 
the global model is a two-layer Markov Random Field or a 
random bond model. The complexity of the local model is the 
communication range among nodes. The local model is near- 
optimal when the approximation error to the global model is 
within a given error bound. We analyze the trade-off between 
an approximation error and complexity, and derive sufficient 
conditions on the near-optimality of the local model. We show that 
when the power attenuation of a wireless channel is larger than 
4, a node needs to communicate with more than 0(1) neighbors 
for a local model to be near optimal. For a slowly decaying 
channel with power attenuation less than 4, a node may need to 
communicate with more than 0{-VN) neighbors to result in a 
bounded approximation error. The two-layer Markov Random 
Fields enable a class of randomized distributed algorithms that 
allow a node to self-configure based on information from its 
neighbors. The distributed algorithms are applied to examples 
of (a) forming a 1-connected physical topology, (b) configuring 
a logical topology that maximizes the spatial channel reuse, and 
(c) reconfiguring from failures, both sequentially and jointly. We 
validate the model, the analysis and the randomized distributed 
algorithms also through simulation. 



I. Introduction 

Wireless infrastructureless networks include sensor- and 
actor-networks, wireless mesh networks, and agent networks. 
Such networks are composed of a physical and a logical con- 
figuration (topology). A physical configuration is characterized 
by node positions and connectivity. A logical configuration is 
characterized by link activities, i.e., a pattern of node-node 
communications on who is communicating with whom and 
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when. For wireless networks, both a physical and a logical 
configuration can vary due to either failures or environmental 
changes. Configuration management is to adapt a physical 
and/or a logical topology to support and maintain node-node 
communications. 

There is no centralized management authority for infrastruc- 
tureless wireless networks. Self-configuration is desirable for 
nodes to adjust adaptively their own positions and communica- 
tion activities in a distributed setting through local interactions. 
A challenge is whether distributed self-configuration would 
result in a near-optimal configuration with a sufficiently small 
approximation error 

In this work, we develop an analytical model for distributed 
self-configuration, and study the issue of near-optimality. To 
be specific, we consider joint formation and re-configuration 
of a physical topology and scheduling upon node failures. For 
simplicity, we focus on ad-hoc wireless networks with node 
failures and random perturbations of positions but without 
mobility. Our prior work |13 | developed an initial probabilistic 
graphical model for the distributed configuration. This work 
enhances modeling, analysis on the performance-complexity 
trade-off, and validation through simulation. 

What is optimality? Deterministic optimization has been 
used to obtain an optimal solution for centralized management 
El El E] 1231 . A cost function is derived to consist of manage- 
ment objectives and constraints. Such an approach has been 
applied to network capacity maximization through adapting a 
physical topology jSl, and to link-scheduling through config- 
uring a logical topology fT\. Open issues are how to include 
random factors such as inaccurate node positions, wireless 
channels, interference, and locally interacting wireless nodes. 
Another open issue is whether a cost function itself is optimal, 
which may go beyond a conventional optimization framework. 

From a computational standpoint, global optimization re- 
quires a centralized entity to maintain and update complete 
information for all nodes in the network. This is impractical 
for large networks. More importantly, locality can be a generic 
feature to configuration management. For example, in a large 
wireless network, nodes and links often fail locally. A local 
repair is thus desirable for preventing an entire network from 
an incessant re-configuration. Therefore, distributed configu- 
ration management is a necessity for large wireless networks 

mini. 

In a distributed setting, each node either adjusts its own 
physical position or decides when and whom to transmit 
based on local information. This should be done in a fully 
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asynchronous and distributed fashion with local information 
exchange. Local information on a configuration may include 
physical locations and communication activities (channel- 
access) of neighbors. Such information can be either sensed 
locally at a node or exchanged with neighbors. The range 
of information-exchange characterizes communication com- 
plexity. When the information-exchange is performed only 
among close neighbors, the resulting distributed algorithm 
would scale gracefully with a network size. 

Numerous distributed algorithms and protocols have been 
developed for topology formation Il28lll30l using local in- 
formation (see |[33l and references therein). Self-organizing 
protocols have been developed for sensor networks ll5l ll26l 
and p2p self-stabilizing networks (see Usl and references 
therein). These distributed algorithms are deterministic. 1251 
uses a simple randomized algorithm to analyze throughput for 
different traffic conditions of wireless ad-hoc networks. |29] 
develops a fast randomized and distributed algorithm for edge 
coloring, assuming a bounded nodal degree on bipartite graphs 
and thus Markovian dependence. [ 1 1 develops a probabilistic 
model with spatial Markovian assumptions to characterize the 
randomness in node positions. Il24l uses gossiping algorithms 
to show that the throughput of wireless networks can be 
maximized in a fully distributed fashion. These randomized 
algorithms either assume Markovian spatial dependence which 
results in the optimality or require non-local information for 
local-decisions. |4| |23| show that more general SINR-based 
interference models result in non-Markovian (long-range) spa- 
tial dependence, and the aggregated interference from small 
interfering links of different strengths may not be negligible 
El- lfT6l proposes a measurement-based self-organization for 
802.11 wireless access network. |16| is based on a Gibbs 
distribution, thus the global optimality can be achieved asymp- 
totically as shown in Q. However, different from 802.11 
wireless access networks, nodes in ad hoc wireless networks 
may not have complete information especially from far-away 
nodes. 

A question arises |24|, if the spatial dependence is non- 
Markovian, when is distributed configuration management 
near optimal? Thus, quantitative conditions need to be derived 
on when and how the distributed management can result in a 
near-optimal configuration. In fact, it has been considered as a 
difficult problem in general to develop a distributed algorithm 
with a predictable performance BTI . The open questions are 

o What measures the optimality of centralized-configuration 
management, and the near-optimality of distributed- 
configuration management? 

o When is it possible for distributed management to achieve 
a near-optimal configuration? 

o How to derive distributed algorithms that obtain a near- 
optimal configuration? 

This work intends to develop a framework to study these 
issues through modeling, analysis, and algorithms. 

Global Model and Optimality: The optimality of con- 
figuration management can be considered in a model-based 
framework. There, if network configurations can be modeled 
based on the ground truth, the optimality can be defined 



accordingly. Centralized configuration management can be 
used to define the optimality of a network configuration as 
nodes are not limited by the scope of information exchange. 

We consider three factors in a model of configuration: 
Randomness resulting from a network internally, management 
constraints imposed externally, and distributed decisions made 
by nodes with limited information. Randomness in wireless 
networks is challenging to model fTsl. This work begins with 
simple scenarios where the randomness results from perturbed 
node positions, interference, and node-node communication. 
Fading is not considered in this work for simplicity. Man- 
agement constraints include requirements on the connectivity 
of a physical topology, signal-to-interference plus noise ratio 
(SINR) and reconfiguration cost. Nodes make asynchronous 
and randomized decisions, adjusting a configuration in a 
distributed setting. The model is developed from bottom-up 
by mapping these three factors onto a probability distribution. 
The model is thus accurate in regard to the "ground truth" 
based on the assumed wireless channel, constraints and nodal 
decisions. Overall, such a model characterizes statistical spa- 
tial dependence of nodes/links in a network configuration. 

To obtain an analytical form of the probabilistic model, 
we adopt an analogy between link activities and node 
positions of a wireless network and interacting particles 
in statistical physics lfT2llf32l . Such an analogy allows the 
use of "configuration Hamiltonian" |fT9l for quantifying 
a network configuration. The configuration Hamiltonian 
corresponds to an artificial system energy of a wireless 
network. The system energy combines the physical topology, 
link activities, and management constraints into a single 
quantity. The configuration Hamiltonian is then used to obtain 
a probabilistic model which is known as a Gibbs distribution 
|12|. Such a Gibbs distribution is for an entire "network" 
configuration and thus corresponds to a global probabilistic 
model. 

Local Model and Probabilistic Graphs: We obtain a local 
model as an approximation of the global model of network 
configurations. To approximate the global model, we rely 
on a one-to-one mapping between a Gibbs distribution and 
a probabilistic graphical model in machine learning ifTl lfTsl . 
The graph provides a simple and explicit representation of 
statistical spatial dependence in a network configuration. 

We show that a probabilistic graph of the global model 
belongs to a two-layer random-field. One layer is for a physical 
configuration, and the other layer is for a logical configuration. 
The graph is fully connected , i.e., non-Markovian, where the 
long-range spatial dependence results from the interference 
among far-away nodes. When the long-range interference 
can be neglected, the global model can be approximated by 
a two-layer coupled Markov Random Field which is also 
called a Random Bond model. The corresponding dependency 
graph exhibits a nested spatial Markov dependence for both 
a physical and logical configuration. Mathematically, such a 
spatial Markov dependence can be represented as a product 
of local conditional probability density functions Q. Hence 
the probabilistic graphical model shows which "dependency 
links" to remove, result in a local model. 
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We define an approximation error to measure the difference 
between the local and global model. When the approximation 
error is within a given bound, a local model is near-optimal. 
We then define complexity to characterize the size of a 
neighborhood where nodes exchange information locally. We 
obtain bounds for the approximation error. The approximation 
error and the bounds depend on the following parameters from 
the physical and logical configuration: the power decay of a 
wireless channel, the density of active (transmitting) nodes, the 
complexity, and the size of a network. We quantify the impact 
of these interacting parameters on the approximation error for 
a large network. We derive sufficient conditions for the local 
model to be near-optimal, i.e., when the total interference 
from far-away nodes decreases faster than the growth of 
the number of interfering nodes as the network size increases. 



allocation that is implemented with local interactions among 
neighbors. 

Configuration Management: Let XiQ and Xj be a desired 
and an actual location of node i, for 1 < i < N. X = 
{Xi, ■ ■ ■ ,Xjv} are random positions of nodes in a network 
where the randomness results from perturbed positions, ran- 
dom movements and measurement errors. 

Let (Tij denote channel-access of link (i, j), where aij = 1 
if node i is transmitting to node j; and cry = — 1, otherwise. 
(T,j is referred to as a "communication dipole" in this work, 
for 1 < i,j < N, i ^ j, and cr = {a^ - ■ ■ jCj^ denotes 
a set of link activities in the network. Link activities are 
assumed to be random as they are triggered by network-layer 
random traffic demands. A logical configuration is cr = {(Jij}. 
A network configuration is {cr,X). 



Distributed Algorithm; A local model results in a 
class of distributed algorithms where nodes self-configure 
through information exchange with neighbors. The range of 
information exchange characterizes the local connectivity 
of the probabilistic dependency graph, and corresponds 
to the communication complexity. The actual information 
exchanged includes relative positions of neighbors and 
activities of adjacent nodes. Node decisions are probabilistic, 
corresponding to randomized distributed algorithms of 
graphical models. 

We apply the distributed algorithm to three examples of self- 
configuration: (a) forming a 1 -connected physical topology 
from a random initial topology; (b) configuring a logical 
topology that maximizes spatial channel-reuse and inces- 
sant communication demands; and (c) reconfiguring jointly 
a physical- and logical-configuration upon failures. We use 
simulation to validate the models and bounds, and demonstrate 
how the framework enables fully distributed spatial scheduling 
algorithms and fault tolerance for wireless infrastructureless 
networks. 

II. Problem Formulation 

A. Assumptions 

Consider a wireless network with the following assump- 
tions. 

Physical Layer: All nodes share a common frequency 
channel. A pair of nodes within a communication range can 
communicate directly with an omni-directional antenna. The 
wireless channel follows a path-loss model with a power 
attenuation of factor a E {2^6}. Shadowing and/or multi- 
path fading are not considered in this work for simplicity. Node 
i transmits with power Pi, where < Pi < Pmax, I < i < N, 
with Pmax being the maximum transmission power, and N 
being the number of nodes in the network. Power control is 
not considered in this work. 

MAC Layer: Let SINRj/i be a given threshold for the SINR 
requirement. Node i can transmit to node j when the SINR 

requirement is satisfied, i.e., SINR^ = — — 

> SINRt;,, where is the distance between nodes i and j, 
and Ni, is the noise power. We consider a scheduled resource 



The objectives are to achieve distributed configuration man- 
agement, i.e., to (a) form a desired physical topology, (b) 
schedule the resource utihzation at a given time to maximize 
the spatial channel-reuse with a desired SINR requirement, and 
(c) reconfigure upon failures by minimizing reconfiguration 
cost. 

B. Formulation 

Let P(<T, X) be a true probabilistic global model of a 
network configuration that results from the above assumptions. 

Definition 1: Optimal Configuration, (cr* , X*} is an optimal 
configuration if it maximizes the global likelihood. 



{(T*,X*)=arg max P{(t,X) 

((T.X) 



(1) 



Let P'(cr,X) be a local model that approximates the 
global model P(cr). 

Definition 2: Near-Optimal Configuration. Consider (&, X) 
that maximizes P\cr,X), i.e.. 



{a',X)=arg max P^{cr,X) 

{(T.X) 



(2) 



Consider an approximation error as the average relative 
difference between the log likelihoods. 



E[A] = E 



logPK,X*)-logP(^,X) 



logP(cr*,X* 



(3) 



where the expectation is over X, X*, a*, and a. For a given 
e > 0, if E[A] < e, the local model P\(T,X) and the 
corresponding realization (&,X) are near-optimal. 

Distributed configuration management requires that 
P'(cr,X) is factorizable, i.e., P'(cr,X)=ny .9y(cr,X), 
where gij{(T,X) is a localized probability density function 
that depends on variables in a neighborhood of nodes i, j and 
link for 1 < i, j < N. The global maximization from 
Eq.([T]) reduces to a set of coupled local maximizations, i.e., 
{&ij,Xi) = arg max(^_^ 9ij{(T,X) for 1 < i,j < N. 

Note that when variables exhibit spatial Markovian de- 
pendence, distributed configuration management is naturally 
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optimallTl. But when variables are non-Markovian, the near- 
optimaHty becomes important. Hence, E[A] provides a per- 
formance bound of distributed configuration management. If 
E[A] is less than a given error, the near-optimality can be 
achieved by a fully distributed configuration management 
based on co-operations of nodes in a small neighborhood. 

Our tasks are to 

(a) obtain a global model P(cr, X) from the above given 
assumptions; 

(b) characterize the spatial dependence of multiple variables 
{(T,X) using a graphical representation of P{cr,X). Obtain 
a simplified graph and a mathematical representation for 
P'(cr,X); 

(c) obtain sufficient conditions for P^{cr,X) to result in a 
near-optimal configuration; 

(d) derive a distribution algorithm, and apply the algorithm 
to self-configuration. 

Table H] summarizes key notations used in the paper. 



TABLE I 
Important Notations 



Q 


Power attenuation factor, 2 < a < 6 




Power of channel noise 


SINRij 




CTij 


Activity of link Oij G { — 1, 1}, r]ij=^ — 




Position of node i 


E{A) 


Approximation error of the local model 




An upper-bound of approximation error E{A) 


e 


Desired threshold of approximation error E{A) 


Z 


NoiTnalization constant 



III. Global Model 

We begin by developing a global model that characterizes 
probabilistic spatial dependence in a network configuration. 
Our approach is bottom-up so that the probabilistic model 
can be obtained faithfully based on the given assumptions and 
management constraints. 

A. Logical Configuration 

We begin with modeling a logical configuration given node 
positions. A logical configuration is considered as random, 
detailed explanations and an example can be found in 

1) Configuration Hamiltonian: We now develop a prob- 
abilistic model for logical configuration cr given a set of 
node positions X. We regard cr as a set of communication 
dipoles. Terminology "dipole" is originally used in a particle 
system in statistical physics. There, a dipole corresponds to 
a particle with binary states, active or inactive |fT9| . Now 
consider each "communication dipole" as a particle. Table / 
compares a wireless network with a particle system through 
their similarities. 



TABLE II 

Correspondence between Wireless (Dipole) Network and 
Particle Systems 



Wireless (Dipole) Network 


Particle Systems (Lattice Gas 119|) 


active(+l) / inactive(-l) 


occupied(+l) / empty(-l) 


interference 


interaction energy 


system potential energy 


chemical potential 


logical configuration 


system state (e.g., liquid or gas) 



Configuration Hamiltonian has been applied to a particle 
system to describe the states of a set of particles under 
the following conditions: (a) active particles are statistically 
distinguishable, and (b) interactions between particles are 
weak. 

We now extend the notion of configuration Hamiltonian to 
the wireless network, where active communication dipoles are 
statistically distinguishable; and interactions among dipoles 
are weak due to decaying interference. We define a system 
energy of a logical configuration as the summation of the 
received power at individual receivers in the network, 

where Pj denotes the net received-power at the receiver j 
by considering the signal component, interferences and noise. 
Note that for a hnk where node i is a transmitter and j 
is a receiver. Pi denotes the transmission power of transmitter 
i, and Pj denotes the received power at receiver j. We define 
the received power Pj to include the transmitted signal from 
the transmitter i and the interference from interferers with the 
opposite sign of the signal from transmitter i. Based on the 
assumptions in Section |II] for a single active dipole (7^ ~ 
1 in the network, the received power at the receiver Pj = 
Pil~j" "''2^"'" , where /y = jX; — A dipole is inactive, i.e., 
(Tjj = — 1 and Pj ~ 0, if node i does not transmit to node 
j. For multiple active dipoles, Pj has addition terms as the 
interference. 

Following the definitions in statistical physics lfT2l . the 
"configuration Hamiltonian" of a dipole system is the negative 
system energy |fT3l . 

H{(r\X) = - E P.^to +/3E(S™'^»J -S^NRt;,)^,?,,, (5) 

ij ij 

where 77^, = — , SINK,, = ^J- — is 

the SINR for dipole dy, SINRt,^ is a given SINR 
threshold, and f3 > is a weighting factor /3(SINRjj 
— SINRi/i)^ serves as a penalty term for the SINR 
constraint, an equivalence of which is used for simplicity, 
i.e., /3[P,/-/77.. - SINR,h(E™„^,, ^'^Tn"'?™ + Nt,,)f. 

For an active dipole (Ty = 1, the interference sources within 
a certain neighborhood from the receiver j are considered as 

'The use of a quadratic function as tlie constraint is for simplicity 
of derivation. A hard constraint is used in the distributed algorithm, i.e., 
PU(SlNRth - SINR,j) where U{x) = 1 for a; > 0; 0, otherwise. 
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the significant interferers; and this neighborhood is denoted 
by nJ^ as the interference range of node j. The Hamiltonian 
can be rewritten as 

H{(T\X)=R^{cT,X) + R2{cT,X)+R-i{cT,X) + Ri{cT,X), (6) 

where R\{(t,X) = Eij'^ij'?'J '■^^ first-order energy of 
individual dipoles, R2{ct,X) = E„„ e ",j.™„'?ij'7™ 

ij 

is the second-order energy with products of two dipoles 
within the interference range, Ra{rT,X) = Ei Em„ e jv^f 
e {n'.,n' } '^iJ,m,^.^^VijVmnVuv is the third-order energy 
with products of three dipoles within the interference range, 
Ri{a,X) = '^ij Riijic -.X) is the total interference outside 
the interference range where Ri-.{cr,X) is the residual in- 
terference outside the interference range of an active dipole 
CTy = 1. The coefficients of the link activities cr depend on 
relative node positions Z^ 's, where 



-PdT" + P ■ {P^l-" - SINRtftiVi,)' 



(7) 



-2p(Pil-/ - SmRthNt) ■ SmRthPJ-'], 

-/3(sinrLp™pj;;";;:/) 



= -2Vp^uUK 



Intuitively, ay corresponds to the increased power when 
dipole (Ty becomes active, aij,mn relates to the interference 
experienced by dy resulting from a neighboring active dipole 
(Jmn, and aij,rnn,uv relates to the interference experienced by 
(Ty from both a.mn and (Juv 



B. Physical Configuration 

Now consider node positions X. X is assumed to be 
random where the randomness originates from perturbed node 
locations, e.g., due to random movements from desired lo- 
cations and measurement noise. For example, a set of desired 
positions can be pre-determined as a management constraint to 
form a regular grid. But the actual node positions may deviate 
from their desired positions, resulting in an irregular topology. 

Management constraints are imposed on the physical con- 
nectivity. The 1 -connectivity is an example where there exists 
at least one connected path between any two nodes in the 
network to achieve the reachability of any source-destination 
pair. A Yao-graph provides a sufficient condition of the 1- 
connected physical topology, where each node has a connected 
Unk with its nearest neighbors every (< ^) radian apart 
). Such a constraint can be represented as 

h{x,,x,)^{ ^ , (8) 



0, 



Lij I , otherwise. 



where eo is a small positive constant, Lij=\XiQ — XjqI is the 
desired distance of lij, XiQ is a desired position of node i, 
which can be pre-determined as management objectives, de- 
sired positions, or the initial positions to minimize changes in 
positions, V (i, j). The resulting Hamiltonian for the physical 
topology is 



H{X) 



E 



(^^-^-')\^Y.Y.hiX.,X,), (9) 



^When Lij = 1^^ for a positive constant /jj, > 0, resulting in equally 
placed nodes and a regular Yao-graph. 



where is the set of the nearest neighbors of node i for 
every 9 radian, a is the variance which is assumed to be the 
same for all nodes for simplicity, and C is a positive weighting 
constant. 



C. Network Configuration 

We now consider a network configuration which consists of 
both a physical and a logical configuration. 

1) Network Configuration Hamiltonian: Combining the 
Hamiltonians from the physical and logical configurations 
results in an overall network-configuration Hamiltonian, 



i/(cr, X) = H{(t\X) + H{X). 



(10) 



2) Gibbs Distribution: A configuration Hamiltonian 
can be related to a probabilistic model through a Gibbs 
distribution ||T9l . Specifically, in a particle system lfT2l . 
the effective system potential energy H{lo), known as the 
configuration Hamiltonian, obeys the Gibbs (or Boltzmann) 
distribution 1121 . P{i-o) = Z^^- cxp^— ^|t^^, where is a 
multi-dimensional configuration-variable, Zq is a normalizing 
constant and T is the temperature of the particle system 

Qini- 

Model of logical configuration: For a logical configura- 
tion cr given node positions X, a Gibbs distribution P(cr|X) 
can be obtained using configuration Hamiltonian H{(t\X), 



Pia\X) 



■ exp 



-HirT\X) 
T 



(11) 



where Z^r ~J2cr ^ ^^^1^-* ^ is a normalizing constant 
and also called the partition function fT). T > is the 
temperature in statistical physics that characterizes the 
stability of the system HIQ. The lower the temperature, 
the more stable the configuration is. T is used in Q 
as a computational variable to obtain the most probable 
configuration (see Section IVll for details). 

Model of physical configuration: Similarly, the Gibbs 
distribution of node positions can be obtained as 



P{X) = Zx'^-exp 



-HjX) 
T 



(12) 



where Zx=J2x ^-^P ^ normalizing constant. 

Two-layer network model: The Gibbs distribution of an 
entire network configuration can be obtained using the overall 
configuration Hamiltonian, which is 



P(a,X) = Zo-.exp(^^(^ 



(13) 



where ^o=E(cr X) '^■^P i ^^'x''^^ ] is a normalizing 



constant. 
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3) Minimum Hamiltonian and Optimal Configuration: An 
optimal configuration maximizes the likelihood function, 

{(T*,X*) ^ arg max P((T,X) (14) 

(<T,X) 

= arg min H{cr,X), 

(<r,JC) 

where H{a,X) = - log [P{cr, X)] /T - log (Zq). Note the 
system energy H{(t, X) incorporates both the randomness and 
the external management requirements. When the constraints 
are satisfied, the penalty terms should be diminishing. There- 
fore, an optimal configuration should satisfy the management 
objectives of spatial reuse and the constraints. 

IV. Local Model 

We now seek a local model P'((T, X) that is factorizable as 
a product of localized probability density functions, and also a 
good approximation to the global model P(cr, X). We resort 
to probabilistic graphical models. 

A. Graphical Representation 

Probabilistic graphical models relate a probability 
distribution with a dependency graph of the corresponding 
random variables ifTl ITSl iflTl . A node in the graph represents 
a random variable and a link between two nodes characterizes 
their statistical dependence. In particular, a set of random 
variables v forms Gibbs Random Field (GRF) if it obeys a 
Gibbs distribution ||19j . Hammersley-Clifford theorem shows 
an equivalence between a probabilistic dependency graph and 
a Gibbs distribution. 

Hammersley-Chfford Theorem |fT9l: Let S = {!■ ■ ■ ,iV} 
be a set of nodes, and v be a set of random variables 
V ~ {wi, • • • , wjv}. V is said to be a Markov Random 
Field if (i) P(v) > for V v in sample space; (ii) 
P{vi\vj for j E S\{i}) = P{vi\vj for j E Ni), where Ni is 
a set of neighbors of node i for 1 < i < N. 

Random field v is also a Gibbs Random Field if its 
probability distribution can be written in a product fonn P(v) 
= Jlcec ^c(i')> where c is a clique, C is the set of all feasible 
cliques, and Vc{v) is a positive function. 




Fig. L Dependency Graph of Link Activities <t given Node Positions X 
for a line network. Contention range is given to be one liop and interference 
range is two hops. 

Hammersley-Clifford Theorem asserts that v is a Markov 
Random Field ;/ and only if the probability distribution P{v) 



follows a Gibbs distribution with conditional independence 
given neighbors. This theorem shows an interesting type of 
probabilistic graphical models where a random variable is 
conditionally independent of the others given its neighbors. 
In particular, when the neighborhood is much smaller than 
the size of a network, the conditional independence implies 
an interesting type of spatial Markov dependence, i.e., a node 
depends on its far neighbors through neighbors' neighbors. 
Such a nested dependence can be shown explicitly through 
local connectivities among nodes in a dependency graph. The 
resulting probability distribution is thus factorizable in terms 
of local probability distributions. 

For example, consider a one-dimensional physical topology, 
and the corresponding dependency graph for link activities 
cr given X shown in Figure [T] Nodes in the graph repre- 
sent binary random variables cr,;j's, and the links represent 
their spatial dependence. For example, solid lines show the 
dependence due to channel contention, the thin dashed lines 
indicate the dependence due to exact interference, and the 
thick dashed lines correspond to an approximation of the exact 
spatial dependence. The first and second rows of dipoles show 
the activity of bidirectional links. 

All communication dipoles are fully connected due to inter- 
ference, and thus exhibit non-Markovian spatial dependence. 
Such spatial dependence is represented by the Hamiltonian 
in Eq.® that consists of the products of all dipole-pairs. The 
fully connected dependency graph shows an uninteresting case 
of a random field where the neighborhood of a node is the 
entire network. This implies that obtaining an optimal config- 
uration would require each node to exchange information with 
all the other nodes in the network. 

B. Approximation 

To our knowledge, little has been done in the prior work 
on how to obtain an approximation of a fully connected 
dependency graph for near-optimal distributed configuration 
management. We observe that the interference outside the in- 
terference range, i?/(cr, X), can be relatively small compared 
to the first three terms of the configuration Hamiltonian in 
Eq.®. The third-order term, i?3(cr,X), can be small also 
compared to the second-order term. That is, in Eq.®, for 

an active link the third terms include l^^^'^l^-^j^'^lZj'^'^ 

where Imj » hj and luj >> Uf, whereas, the second term 

is only the product of lij^'^^l^'j^^ ■ Hence, if we use the first 
two terms to approximate the configuration Hamiltonian, we 
have 

and the corresponding Gibbs distribution is 

P'HX) = Zr^.cxp( '^y^) ), (16) 
where Zi is a normalization constant. 

As the sum in Eq.dTSll only involves neighboring dipoles which 
are within the interference range, the resulting dependency 
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graph now has a small neighborhood (see the thick dashed- 
lines in Figure [T]l. In fact this approximated Markov Random 
Field is the well-known second-order Ising model |fT9l where 
the Hamiltonian i/'(<T|X) consists of both the first- and 
second-order terms of cry 's. Such an approximation can also 
be obtained directly from the probabilistic dependency graph 
of the global model. That is, by removing all edges outside the 
interference range of each node, we can obtain the graphical 
representation of the local model. 

C. Spatial Dependence in Physical Topology 

We now examine the spatial dependence of node positions 
X. In general, spatial dependence in physical topology is not 
always Markovian since a general management objective can 
correspond to a fully connected graph. However, for many 
important and practical management objectives, a node only 
needs to interact with the close neighbors. Thus we also 
consider a physical topology that exhibits the second-order 
Markov dependence in this work. 

For example, under the 1 -connectivity constraint shown in 
Eq.®, node positions X correspond to a second-order Markov 
Random Field, where the interactions are only with the first- 
order neighbors. 

D. Two-layer Markov Random Fields 




V V - - J 1. 



Fig. 2. Two-layer Graph and Clique of (cr, X). The Dashed-Line Box: A 
Clique of the Two-layer Graph 

The probabilistic graph of an overall configuration can 
be obtained by combining two graphs for the logical and 
physical configuration. For the example network, a two-layer 
graph is shown in Figure |2] as an approximation of the 
original overall configuration. The upper layer graph is for 
the logical configuration that assumes the local interference 
among neighboring active dipoles. The lower-layer is for the 
physical configuration that shows Markov dependence of node 
positions due to the 1 -connectivity constraint in Eq.®. The 
entire graph thus exhibits spatial Markov dependence at both 
layers. 

This two-layer graph corresponds to a coupled MRF jT], 
where an Ising model and a second-order MRF are combined 
together. The graph is also known as a Random-Bond model 
||T9l . where dipoles are connected by random bonds which 



depend on node positions. The two-layer MRF (<t, X) can 
also be represented by a chain graph f2T| of two MRF layers, 
one for X and the other for cr. The two-layer probabilistic 
graph thus maps the complex spatial dependence of a multi- 
hop wireless network to an explicit graphical representation. 

The corresponding likelihood function can then be repre- 
sented as 

P'(<T,X)cx[]5y(*T,X), (17) 

where gij (cr, X) is a local probability density function and can 
be represented as a function of the sum of clique potentials: 

-1pc{(T,X) 



gij{(T,X) 



T 



exp 



T 



where dj is the set of all cliques including node i, node j, 
and link (i, j); and ipc{cr,X) is a clique potential function of 
a clique c. 

As an example, consider a clique for nodes 3 and 4 as well 
as link (3,4) shown in Figure |2] The corresponding potential 
is a collection of related clique functions, i.e.. 



EceC34 V'c(cr,X) = 

{Xi-XiiO)f 



{X3-x3{o)r 

2^ 



CJ2je{2,4} K^^i^j) 

CEje{3,5} ^^(^4, Xj) + a34 + Em„G<x„, 
a +1 

{ot2,i,mn + amnM) ' whcrC aj^l^ = {cri2,Cr21, 0-23,0-32, 

0-43, 0-45,0-54, 0-56,0-65} corresponds to the set of neighboring 
dipoles of 0-34 within the interference range. The solid line 
connecting 0-34 and 0-45 indicates the spatial dependence due 
to the channel contention. The dash line connecting 0-34 with 
either X^ or Xg indicates the dependence of dipole 0-34 with 
positions (X^Xe) of a neighboring dipole 0-56. In general, a 
clique is determined by the interference range. 

V. Analysis: Near-Optimality and Complexity 

We now derive near-optimality conditions for a local model 
to be a good approximation of the global model. The con- 
ditions can be obtained through the approximation error (Eq. 
(O), communication complexity, and their trade-offs. 

A. Communication Complexity 

The neighborhood size in a Markov Random Field corre- 
sponds to the range of information exchange of a node with its 
neighbors, and thus characterizes communication complexity. 
Specifically, the communication complexity of an active dipole 
can be regarded as the maximum number of active dipoles 
within its interference range. As such a maximum number 
is random and varies from dipoles to dipoles, we use a 
deterministic bound for the number of active dipoles. 

Assume that an active dipole o-^ satisfies the SINR re- 



quirement, i.e., SINRij = 



> SINR 



th. 



for 1 <,i,j,N, i ^ j. Consider a circle centered at the 
receiver Xj of the active dipole cr^ within which there cannot 
exist any active dipoles for the SINR requirement to hold. 
The radius of the circle is the contention range for node j. 
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Denote the minimum contention range for all active dipoles 
as Tc which is the minimum distance between any two active 
dipoles in the network. Only one dipole can be active within 
a contention range. Now consider interference range outside 
the contention range where multiple dipoles can be active 
concurrently, resulting in interference. We now bound the 
interference region using a circular region of radius rj. The 
region includes active dipoles outside Vc but within rj shown 
in Figure|3] Note that the actual interference range of a receiver 
may not be symmetrical in all directions but bounded by the 
circular region. This circular region is now considered as the 
relevant interference neighborhood for node j. 

By packing the circular region with small circles of radius 
Tc, we can obtain the maximum number of active dipoles in 
the interference neighborhood. 



•* 



I 'c 



a = 2 



2 

4 — a 



{-{VCrc)^ + (VCrc + VNlthrc)'^) |, else; 



Fig. 3. Contention range Tc and interference range rj of an active dipole 



With J = 21^1/ i^l^^ - ^l;^^lSINRth - Nb/Pt 

The proof is given in Appendix U The above analysis 
shows that a local model results in two components of 
the approximation error "Type-1 error'H is due to using 
a model of a lower order within interference regions, and 
bounded by the first terms in the above expressions. For a 
given communication complexity C, the Type-1 error can 
be made small by choosing a sufficiently large Tc- This 
would reduce the number of active dipoles as well as the 
dependence among the active dipoles. "Type-2 error" is 
due to neglecting the aggregated interference outside the 
interference range, and upper bounded by the remaining terms 
in the above expressions. Type-2 error can be made small if 
the aggregated interference outside the interference range is 
reduced. Specifically, as shown by the theorem, for a > 2 and 
a given Tc, communication complexity C needs to grow with 
respect to network size N for the error to be arbitrarily small. 
The growth rate depends on the power decay of the channel, 
since the aggregated interference outside the interference 
range depends on a. For a slow power decay, i.e., a — 2, the 
above result suggests that the Type-2 error is independent of 
C, and thus can only be reduced by a sufficiently large Tc- 



Definition 3: Communication Complexity C. The communi- 
cation complexity of a dipole atj is defined as the maximum 
number of active dipoles within the interference range, i.e., 
(^?,forl<i,j<N, i^j. 



B. Near-Optimality Conditions 

We now derive sufficient conditions for a local model to 
be a good approximation of the global model. We consider 
homogeneous networks for ease of analysis. 

Theorem 1: Consider a network where N nodes are uni- 
formly distributed, and satisfies the 1 -connectivity where the 
distance between any node and its neighbors for every is 
between lth{^ — eg) and Ithi^ + eo)> i^nd < lth{^ + eo) < '^c- 
Assume that N nodes transmit at the same power level 
(Pt > 0), have the same desired SINR threshold SINRth 
and the same circular interference range rf. Let a > 2 be 
the power attenuation factor of the channel. The average 
approximation error can be bounded as E[/\\ < e^, where 



Channel, contention and network size: Using the upper 
bound of the approximation error e^, we obtain a sufficient 
condition on the density of active dipoles for a large network 
so that the local model is near-optimal. 

Corollary 1 : Let e be a desired performance-bound. Assume 
N ^ 1, and C does not grow with respect to N. We have 
SA <e if 



0{N~ 



Oil), 



2 < Of < 4, 
a = 4, 
a > 4, 



where 0{) represents the order for large N. 

The proof can be obtained through simple algebraic 
manipulations from Theorem 1, and is thus omittec|3. 

Consider the above expression for large interference where 
2 < a < 4. This corresponds to such channel environments as 

'We use the Type-1 and Type 2 errors for convenience and they are different 
from those in hypothesis testing. 

^From Theorem 1, C is a function of r'c; thus, C is replaced with r^/r^. 
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free space a ~ 2, obstructed areas in factories a G {2 ^ 3}, 

4— g 

and urban areas a G {2.7 ~' 3.5}. Tc > 0(A^<'+° ) shows that 
Tc grows with N sublinearly. If communication complexity C 
remains constant when network size N increases, the maxi- 
mum number of activated dipoles is o{N ^3+^) = o{N^+^), 
showing a sparsely activated network. Hence, the correspond- 
ing Markov Random Field with a small neighborhood size 
may have insufficiently activated dipoles, and may not be an 
efficient approximation to the global modejf] 

Now consider small interference where a > 4. This 
corresponds to such channel environments as shadowed urban 
areas with a e {4 ~ 5}, and obstructed regions in buildings 
(a e {4 - 6}) Ell. When 4 < a < 6, > 0(1). This 
implies that Vc can grow with N at an arbitrarily slow rate 
for the approximation error to be small. The network has 
densely activated dipoles, i.e., the number of active nodes can 
be slightly fewer than 0{N), showing a densely activated 
network. This case shows that the Markov Random Field 
with a small neighborhood now has a sufficient number of 
activated dipoles, and is thus an efficient approximation to 
the global model. 

Topology: Note that for the uniform network assumed in 
the theorem, Iff^, the inter-distance between two neighboring 
nodes, characterizes the physical topology. Thus, Corollary 1 

1 /3 

shows that Vc increases with respect to for 2 < a < 4 
and Ith for a = 2. This shows that the rate of growth of the 
contention range with respect to the inter-node distance. 

Performance-complexity trade-off: We now examine how 
C can vary with respect to a for large N. 

Corollary 2: Assume iV 3> 1 (and C < N). can be 
simplified as 



I 

77 



Ai{Nlthrc)^ +o(), 



«()} 



77 



|2(2 



1 %2 



|ln(fe))} 



Ai + A2 



^ + Az-4^ + 0{N^), 



where Ai 



A -7- 

a 1 



4irl 



A. 



~2I 



Q = 2 

2 < a < 4 
a = 4 

a > 4; 

, 



The proof can be obtained by rewriting eA and simple al- 
gebraic manipulations, and thus omitted. The corollary shows 
that the larger the communication complexity C, the larger the 
Type-1 error, since more dependency is neglected by the local 
model within an inference range. In addition, a larger C should 
result in a smaller Type-2 error. However, for 2 < a < 4, 
the bound for the Type-2 error is nearly independent of 
C for large network size N. This suggests that the strong 
interference outside C dominates for a slow decaying channel, 
and a Markov Random Field may not efficiently approximate 

'Note that a lower bound would be needed for the approximation enor to 
draw a definitive negative conclusion. 



a global model. For a > 4, ca monotonically decreases as 
C increases, suggesting that the spatial dependence within an 
interference range dominates for a rapidly decaying channel. 
In fact, eA can be made arbitrarily small if C and Vc grow at a 
slow rate with respect to A^. Markov Random Fields are thus 
an efficient approximation of the global model. 

Figure |4] shows an example of eA as a function of C and 
a for SINRfft ^ 20, Nb = 0.1, rc = 10, rj e {20 - 100}, 
N = 1000, and kh 2 meteM The flat region in Figure |4] 
corresponds to small a (large interference), where the SINR 
requirement is violated and eA is truncated to remain constant 
for illustration. In contrast, eA is less than 10% for a = 4 and 
below 1% for a = 6 as shown in the figure. 

Figure |4] shows a trade-off between the approximation 
error and the communication complexity for a ^ 4. The 
intersection C and eA, e.g. between the two thick lines in the 
figure, corresponds to an optimal neighborhood with C=16. 
This corresponds to0 r/ = 40 and rc = 10. In general, for 
a given r/, rc can be adjusted to vary C so that a proper 
trade-off can be obtained. A wide range of 3 ~ 20 hops for 
Tc would be the most feasible scenarios. 




Fig. 4. Trade-off between performance and complexity. Thick dashed line: 
a=4. 



VI. Distributed Algorithm 

We now describe randomized distributed algorithms enabled 
by the local model for self-configuration where nodes make 
decisions using local information from neighbors. 

A. Distributed Algorithm 

The distributed algorithm obtains a near-optimal configura- 
tion by maximizing the approximated UkeUhood function. 



(6-,X) = arg max P'((T,X). 

(<T,X) 



(18) 



^ This corresponds to sensor networks for habitat monitoring, battlefield 
surveillance, and mechanical measurement and monitoring. 

^In general, an actual optimal value of rj depends on a constant that 
weights the relative importance of perfonnance and complexity. 
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Due to the spatial Markov property, P^{a,X) is factoriz- 
able over cliques. Maximizing the likelihood function reduces 
to maximizing the local probability density functions (clique 
potentials), i.e., for 1 < i, j < N , 

{(Jij,Xi) = Mg max P\a,j,Xi\XN^,aNi,), (19) 

{aij,Xi) 

where P\aij,Xi\XN,,(TN,j) = gij{cr, X), Xn, and ctj^i are 
in the neighborhood of node i and dipole o-y , respectively. 
These local probability density functions are composed of 
the neighboring nodes and dipoles, and the configuration 
can thus be updated locally. The local maximizations result 
in coupled equations due to the nested Markov dependence 
among random variables of cr and X, showing the need of 
information exchange among neighbors. 

Many algorithms can be used to maximize the local prob- 
ability density functions, e.g., stochastic relaxation QITI and 
message passing ifTTll . This work uses stochastic relaxation 
which is a randomized algorithm. The algorithm converges 
to the global maximum of P'(cr,X) asymptotically with 
probability one 13. 

In stochastic relaxation, each node makes local decisions 
based on a certain probability. That is, node i decides Xi and 
{cTy } for j e Ni, which results in a joint optimization of 
the topology and scheduling. Specifically, let Xi{t + 1) and 
(Jij{t + 1) denote the new position that node i would move 
to and the activity of link (i, j) at time t + 1, respectively. 
A sequential implementation of the distributed stochastic 
algorithm at a node i is described as follows; for i > 1 and 
l<i<N, 

(a) given neighbor positions X^. in the neighborhood Ni 
of node i, determine X.i{t + 1) = Xi, with probability 

p'ix,{t + i)^x^\XNM) - ■^^p(-^'(^')/^(*+i)) 



where ipi{xi) 



■ 



^^^^exp(-^.(x.)/r(t+l))' 

CEjeJV" h{xi,Xj{t)). 



(b) Given link activities aN-. and positions Xj\j. of nodes 
in the neighborhood of node i, determine whether node i 
transmits to node j for Vj G Ni, i.e., (Jij{t + 1) = aij with 
probability P' (dy (< + 1) = cr^ |XAr,(t), ctat., (*)) = 

cxp(-fc(..,)/T(.+l)) ^^^^^ ^^^^^^^ ^ 



Ev.,, cxp(-,Aij(fT,j)/T(t+l))' 



II \oii 
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received from neighbors, i.e., (X„j(t), o-,„„(i) = 1) for 
m ^ Ni, to update its own local configuration, Xi{t + 1) and 
{(Tij{t + 1)}. Here {(Tij{t + 1)} denotes the set of adjacent 
dipoles of node i. Meanwhile, a node that receives a message 
{Xi{t), (Tij{t)=l) also relays the message to its neighbors. 

Such information exchange bears a similar spirit to that of 
Bellman-Ford routing algorithm. Hence the near-optimal dis- 
tributed algorithm uses relative geo-location information and 
link activities from neighbors. Note that as an implementation 
issue, neighbor positions can also be sensed at a node to avoid 
information exchange among neighbors. 

VII. Self-Configuration: Example and Validation 

We now apply the distributed algorithm to examples of 
self-configuration of wireless networks for (a) 1 -connectivity 
formation of physical topology; (b) scheduling for a logical 
configuration; and (c) localized failure adaptation. We also 
compare the performance of the randomized distributed algo- 
rithm with the centralized algorithm from the global model 
and the commonly-used distributed protocol modelll6l 191 1221 . 

A. Example of Self-Configuration 

We simulate the distributed algorithm for self-configuration 
using the following parameters; Network size N ~ 100, the 
threshold of the inter-node distance Ith = 2 and ~ ^ for the 
1 -connectivity of the physical topology, a = 4 for the channel, 
and SINRf/i = 20. Topologies are randomly generated first. 
Figure [5] (a) shows an example of a random initial configura- 
tion (cro=0, Xq). Each node then updates its position based 
on the iterative statistical local rules using information from 
its neighbors as in Section IVI-AI The iteration stops when 
a steady state is reachecfl The dots in Figure |5] (b) show a 
resulting physical topology with 1 -connectivity. The physical 
topology then remains fixed during formation of a logical 
configuration. A logical configuration is obtained similarly by 
the probabilistic local rules in Section [VI-AI given the physical 
configuration. Figure |5](b) also illustrates the resulting logical 
configuration at the first time epoch. 



Temperature T is used as a cooling constant in the algo- 
rithm, where T{t) = ro/log(l + t) varies with time t and 
To=3 fl||19|. This allows an almost-sure convergence to the 
global minimum of the Hamiltonian (see Q for more details). 

Note that the joint configuration results directly from 
the distributed algorithm, i.e., given aNij and positions 
X^., {Xi{t + l),aij{t + 1)) = {xi,aij) with probability 
P'{iX,it + l), try it + l))^{x^, a,j)\XN^ {t) , aN,, (i)) ■ 

B. Information Exchange 

At time t, each node (e.g. node i) broadcasts its position and 
adjacent link status, {Xi{t), (Jij{t) = 1), to the neighboring 
active dipoles. At time t + 1, node i uses the information 



(a) Xo (b) <T given .X" 

Fig. 5. Self-configuration with localized algorithm 

Now, consider that a certain nodes fail in a network con- 
figuration shown in Figure |5] (b). Upon failures, the closest 



'^For instance, the condition of 
and i e TV®. 



< 0.01 is satisfied for Vi 
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(a) Random failure of nodes (b) Localized recovery of <t 

Fig. 6. Localized recovery from random node failures 

neighbors of failed nodes first adjust their positions and 
select other nodes for transmission. This may cause adjust- 
ments to the neighbors and neighbors' neighbors, resulting 
in cascading changes across the entire network. Hence small 
perturbations in a network can cause incessant changes to the 
entire network configuration. Thus localizing failure events 
is important. Additional penalty terms can be introduced to 
the Hamiltonian as reconfiguration costs to penalize cascading 
changes: ^ • |(cr,X) — {(7s,Xs)\, where {(Ts,Xs) denotes 
the steady-state network configuration, and ^ is a positive 
weighting constant that characterizes the cost of change in 
node positions and/or link activities. Such a constraint would 
localize the change. 

Figure|6](a) shows failures of wireless nodes that are marked 
as stars. Localized recovery of the physical topology is shown 
in Figure |6] (b). The nodes outside the arc are not affected 
by failures; the resulting configuration, however, is no longer 
globally optimal. The failed logical topology can be locally 
recovered using the same algorithm also shown in Figure |6] 
(b). 

Figure |7] shows the joint optimization of cr and X. The 
joint optimization is done at a node-dipole pair, i.e., {Xi, aij} 
for j E Ni first, and then moved on to another node-dipole 
pair {X„i,(T„in} for n E Nm, and so on. Compared to the 
sequential configuration in Figure |5](b), the joint optimization 
in Figure |7] increases the number of active dipoles by 30% 
(from 15 to 20). 
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Fig. 7. Joint Optimization of Link Activities and Node Positions 
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Fig. 8. Comparison of One-hop Capacity; Global, Local, and Protocol Model 

B. Model Validation 

We now validate the performance of the local model, 
communication complexity and the trade-offs. 

Performance: We first examine through simulation the 
performance of a local model compared with that of the global 
model and commonly-used protocol models in the prior work 

iiEiiia. 

A wireless network is generated with 50 randomly- 
positioned nodes, a = 4, and SINRt/i = 20. 10 simulations are 
conducted using random initial topologies and link activities, 
and the results are averaged. The interference range is the 
entire network for the global model, and chosen to be J'/=4 
for the local model. The protocol model assumes a separation 
(i.e., contention) range of without using an interference 
range. 

Figure [8] shows the one-hop capacity achieved by the global 
model, the local model, and the protocol model, respectively. 
For Ts large, the protocol model fails to maximize the spatial 
channel-reuse. This is because the contention constraint is too 
stringent. For small, the protocol model over-utilizes the 
channel resource, resulting in a violation of SINR constraint 
and a reduced spatial-reuse. Even at the optimal separation 
range (rs=3) the spatial-reuse of the protocol model is less 
than that of the local model by 23%. The inefficient spatial- 
reuse is due to the protocol model that cannot take a non- 
circular contention region into consideration. On the contrary, 
the global and local models characterize the interference 
and non-circular contention regions and thus represent the 
neighborhood system of dipoles more accurately. Compared to 
the global model, the local model has 8% less spatial reuse on 
the average, showing the performance degradation of a simpler 
model. 

We now examine the tightness of the bound on the approxi- 
mation error Specifically, the approximation error is measured 
from simulations and compared with that calculated from 
Theorem 1. A linear topology is selected, where 100 nodes are 
randomly placed and a node communicates with two neigh- 
bors. A linear topology is chosen as it provides a worst case 
of non-circular contention- and interference-regions. Other 
parameters used in the simulations are a = 4, SINRt/i=10, 
A^=100, rc=10, and a varying r/ to obtain values for C. 
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Results are shown in Figure |9l The measured approximation 
error decreases sharply as C increases and is indeed bounded 
by i?(A). The bound follows the same trend as the actual 
approximation error The difference between the measured 
approximation error and the bound may result from the linear 
topology whose density of active links is much sparser than the 
assumptions in Theorem 1. Intersections between C and -E[A] 
(or the bound) illustrate the performance-complexity trade-off. 




Fig. 9. Measured Approximation Error and Upper Bound -E{A) 

Complexity: We now compare the communication 
complexity of a local and a global model through simulations. 
We use the same parameters and the number of runs in the 
simulation. The actual communication complexity is obtained 
by counting the number of the neighboring active links within 
the interference range of each active dipole, and averaged 
over all active dipoles and 10 runs. Figure [TO] shows the 
communication complexity for both the global and local 
model as a function of network size N . The communication 
complexity of the centralized global optimization increases 
linearly with N since each node uses the information from all 
other nodes in the network. The complexity increases slowly 
with N for the distributed algorithm. 



communicaiton links = BO, cycle length = 32 time-slots 

6r 




Fig. 11. Timeslot Allocation of Spatial Time Division Multiple Access 
(STDMA) 

prior work |j24l that shows optimal scheduling decisions can be 
done in a fully distributed fashion. [24] considers packet-level 
scheduling, the queue size and packet arrivals but a simplified 
interference model. Our model does not include packet arrivals 
and queue occupancy but uses a general interference model 
and adapts physical topology also. To compare these two 
works at a common ground, we assume that there is one 
queue for each communication link and a queue is always 
busy, and a physical topology is given. Our local probabilistic 
model is extended directly to include multiple time-slots in the 
coefficients and details are given in Appendix HIl We conduct 
a simulation for scheduling using the following parameters: 
a = 4, N= 25, lth=2, and rf=6 for regular topology in 
Figure [TT| There are 80 links and 32 assigned time-slots. The 
figure shows that the optimal allocations are obtained in these 
examples by the local model and the distributed algorithm 
while satisfying the SINR constraints. Hence, compared with 
[24] when a physical topology is given, an advantage of the 
local model is its ability to approximate the general (non- 
Markovian) SINR model. A disadvantage of our local model 
is a lack of specifications of network traffic. 



VIII. Conclusion 




Number of Nodes N 



Fig. 10. Communication complexity C vs network size N. P and P': global 
and local model, respectively. 

Extension: We now consider an extension to multiple time- 
slots to show the promise of the distributed algorithm to 
scheduling in a more realistic setting. This is motivated by the 



We have developed an analytical framework in which near- 
optimality, approximation and randomized distributed algo- 
rithms can be studied for self-configuration, i.e., joint con- 
figuration of a topology and scheduling. Our findings are 
summarized as follows. 

(a) We begin with a global model that characterizes the 
ground truth in regard to network assumptions. The ground 
truth includes the randomness from a network internally and 
management constraints imposed externally. The resulting 
model is a Gibbs distribution where the exponent can be 
regarded as a cost function. This relates modeling with opti- 
mization so that both randomness in a network and objectives 
can be combined naturally. This approach differs from the pure 
"emerging behavior" where nodal decisions are not governed 
by an optimal performance. This approach also differs from 
the "black-box" method where models are learned externally. 

A disadvantage of our approach is the simple assumptions 
that limit the current model. Fading has not been included 
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in our model. Traffic demands at network-layer have been 
considered in developing a probabilistic graphical model in 
ETl but not yet in this work. 

(b) The mathematical representation of the global model 
quantifies the statistical spatial dependence of node positions 
and node-node communication. The complex spatial depen- 
dence is represented explicitly by a probabilistic graph. The 
graph shows how to obtain a local model to approximate 
the global model. The local model is a two-layer Markov 
Random Field or a random bond model. The complexity of 
the local model is the communication range of nodes from 
neighborhoods in the Markov Random Field. We remove 
long links in the graph to obtain a local model, and a more 
sophisticated approach may be explored to obtain a better 
approximation. 

(c) We have derived sufficient conditions on the near- 
optimality of the local model under different channel con- 
ditions, density of nodes, network size and complexity. The 
conditions show a trade-off between the near-optimality and 
complexity. For example, a local model is near-optimal with 
a moderate complexity if a channel has a fast power decay 
of order at least 4; and may not be near-optimal, otherwise. 
The near-optimality conditions thus complement the empirical 
choices in protocol designs (e.g. 802.11), and may provide a 
practical utility in regard to the effect of channel attenuation, 
densities of nodes, and node-pair communications. 

(d) Near-optimal local models render a class of random- 
ized distributed algorithms for self-configuration. The algo- 
rithm allows each node to adapt probabilistically its local 
configuration using only information from neighbors. Local 
self-configuration collectively achieves a near-optimal global 
configuration. Also, the distributed algorithm achieves a near- 
optimal configuration at a bounded communication complex- 
ity. Hence the algorithmic advantage is a strength of the local 
model. We have shown examples of stochastic scheduling and 
reconfiguration upon failures, and compared the performance 
and complexity with existing protocol models. 

One disadvantage of stochastic relaxation is the slow con- 
vergence that has been explored in the prior work 1101 . Other 
simple randomized algorithms have been discussed in ||20l . It 
would be beneficial to study simpler randomized algorithms. 

Appendix 
I. Proof of Theorem 1 

Proof: We begin proving the theorem by bounding the error, i.e., 
^ H(a'iX) maiX) ^^ |H(g»|X) H(g|J^)r where the super- 

scripts u and / denote an upper and a lower bound of the 
corresponding quantity. 

To find an upper bound of the numerator, we have \H{(t*\X) — 
H{&\X)\ = \{H{a''\X)-H'{cr*\X)) + {H'{a\X))-H{a\X) + 
{H'{a*\X)-H'{a\X))\ < \H{a''\X)-H'{(T*\X)\ + \H{a\X)- 
H\a-\X)\, where the inequality holds since H{fT*\X) < 
H{&\X), and H\a\X) < H\fT*\X) by definition. 

For any configuration (it\X), \H{(t\X) - < 
\Ri{cr\X)\+ \R3{a\X)\. Let and J/f be an upper bound 
of |i?3(cr|X)| and |_R/((t|X)|, respectively, where 



i 11 



\Rsi<T\X)\ < ^ }^2Ptk,- k, 



k-i =1 fc2=l 



a = 2 



< 2Pt\^l + {J k — dk)j 
2Ptr-"(l + 0.51nC)^ 
^(a-2(C)^f, a>2 

= /3, 

where "1" appears in the above expression to ensure that the discrete 
sum upper bounds the integral. The penalty term in the configuration 
Hamiltonian is not included as I3 is an upper bound. 

We now find Ir. Consider an active dipole tJij, and let Gk be the 
set of neighboring active dipoles that are Vf + kvc apart from the 
receiver. The cardinality of Gk is upper bounded, i.e., \Gh\ < ^ = 

^/sin-i (77T(fcFr) < Mrf + (k - l)r.)/r,. 

For an active dipole aij, let Itu be an upper bound of the residual 
interference outside interference rangfl i.e.. 



— ^ Tc 



where kjj is an integer that satisfies the inequality 



rc ~ ^ Vc 

fe=i 



(20) 



The value 2)i»i(i+(;o) ^jgjjQjg^ upper bound of the 
maximum number of available active dipoles except (Ty (with 
two nodes) in a network with total N node s, kjj ca n be solved 
from the above inequality as kij < rc + ^/NlthTc- Using this 
bound for ku and replacing r/ by we have 



/ + / (r; + (fc-l)r,)- 



-dk 



rc(«th(l + eo))^ 

, ^ + ln(l + WNlth/iCr,)) , a = 4 

a^^^ (C^r^ + j^- 

= Ir. 

Thus, IHicr^lX) - H{a\X)\ < \H{a\X) - H'{a\X)\ + 
\H((t''\X) - H'icr'lX)] < 2{lR, + h)N„*, where N„* is total 
number of active dipoles in cr* given X. 

To obtain a lower bound for |i/((T*|X)|, we take only the 
net energy term in H{cr*\X) as the penalty term is usually 
small, i.e., close to zero when the constraint is satisfied. 
Then \H{<T*\X)\ > min{P,/-/- E„.n^., PtlZ } 

'Similai' to ^3, the penalty term is not included for deiiving an upper bound. 
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for V (Tij — 1. For CTy 



1, to satisfy a given SINRf/j, a 



sufficient condition is (Pt(E™„/.,^^)' + Nt)/iPt{ll 
< l/SlNRt^, thus ° ~ 

Therefore, \H{rT*\X)\ > jPtl-^" {I + eo) 



Y < ( iJ 



Pt ■ 



Hence, i ^i" i^j^^ioi^n i < ^^ere = 



(1-eo) — /SINR,,-^, 



Assume eo=0 for a simple representation, 

^ ((2 + inc)' + ^(rc + ^/M;;:7:)) 



a = 2 



I / 2(2- 
T I ' 



1 ,2 

T> I 47r / 1 
3 + 72-(. 



ln(l + 



0) 



2(ci-2C" 



(a-2)2r; 



^ i-a y 



Where 1= 2/ [IZ i^Z " ^Khl^^^^th - N^/Pt) 

Moreover, since the bounds derived above hold for 
any {a, X), they also holds for the expected value, i.e., 

II. Model Extension to Multiple Timeslots 

As the slot allocation is based on the spatial dependence among 
neighboring nodes and links, we extend the local model and the dis- 
tributed algorithm to spatial time division multiple access (STDMA). 
We assign a time slot Sij to a dipole 1 < Sij < Smax, where 

Smax is the TDMA cycle length. Since Smax is unknown, we then 
generalize the coefficients in the Hamiltonian in l[6j as functions of 
time slots, 

= aij ■ LOsiSij), (21) 

where Sij is the time-slot number of TDMA cycle for dipole 
and 5{Sij, Smn)=l if Sij = 5mn; 5{Sij,Smn)=0, otherwise. 
iJsiSij) is a function that is inverse-proportional to Sij, and LJs{Sij) 
= g!— is used in this work. This enables early sequences of timeslots 
to play an more important role in minimizing the potential energy. 
For example, the first timeslot is utilized first and so on. As a result, 
the length of STDMA cycle Smax is minimized. 

The resulting MRF model is now defined over multiple orthogonal 
time slots, i.e. Smax time slots, and a communication dipole (Jij now 
takes an extended form of Uij ■ 5{Sij — s), where 5{Sij — s)=l if 
Sij=s\ 0, otherwise, for 1 < s < Smax- 
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